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This invention relates to polymorphisms in the human P2X7 gene and corresponding 
novel allelic polypeptides encoded thereby. The invention also relates to methods and 
materials for analysing allelic variation in the P2X7 gene, and to the use of P2X7 



polymorphism in treatment of diseases with P2X7 drugs. 

The P2X7 receptor (previously known as P2Z receptor), which is a ligand-gated ion 
channel, is present on a variety of cell types, largely those known to be involved in the 
inflanimatory/immune process, specifically, macrophages, mast cells and lymphocytes 

1 0 (T and B). Activation of the P2X7 receptor by extracellular nucleotides, in particular 

adenosine triphosphate, leads to the release of interleukin- 1 P (IL- 1 p) and giant cell formation 
(macrophages/microglial cells), degranulation (mast cells) and L-selectin shedding 
(lymphocytes). ?2X-j receptors are also located on antigen-presenting cells (APC), 
keratinocytes, sahvary acinar cells (parotid cells) and hepatocytes. Compounds acting at the 

15 P2X7 receptor are therefore indicated as pharmaceuticals for use in the treatment of 

rheumatoid arthritis, osteoarthritis, psoriasis, allergic dermatitis, asthma, chronic obstructive 
pulmonary disease (COPD), hyperresponsiveness of the airway, septic shock, 
glomerulonephritis, iaitable bowel disease, Crohn's disease, ulcerative colitis, atherosclerosis, 
growth and metastases of malignant cells, myoblastic leukaemia, diabetes, Alzheimer's 

20 disease, meningitis, osteoporosis, bum injury, ischaemic heart disease, stroke and varicose 
veins. For further background, the reader is referred to the following articles: North and 
Barnard in Current Opinion in Neurobiology 1997, 7, 346-357; Rassendren, JBC, 1997, 273, 
5482-6; and Buell, Receptors and Channels, 1998, 5, 347-354. The terms P2X„ V2X, receptor 
and P2RX7 are used interchangeably herein. 

25 All positions herein of polymorphisms in the 5' UTR region of the P2X7 

polynucleotide relate to the position in SEQ ID NO 1 unless stated otherwise or apparent from 
the context. 

All positions herein of polymorphisms in the exon regions of the PIX-^ polynucleotide 
relate to the position in SEQ ID NO 2 unless stated otherwise or apparent from the context. 
30 All positions herein of polymorphisms in the intron regions of the P2X7 

polynucleotide relate to the position in SEQ ID NO 3 unless stated otherwise or apparent from 
the context. . ^ 
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All posiiions herein of polymorphisms in the F2X^ polypeptide relate to the position in 
SEQ ID NO 4 unless stated otherwise or apparent from the context. 

One approach is to use knowledge of polymorphisms to help identify patients most 
suited to therapy with particular pharmaceutical agents (this is often termed . 
5 "pharmacogenetics") . Pharmacogenetics can also be used in pharmaceutical research to 
assist the drug selection process. Polymorphisms are used in mapping the human genome and 
to elucidate the genetic component of diseases. The reader is directed to the following 
references for background details on pharmacogenetics and other uses of polymorphism 
detection: Linder et al. (1997), Clinical Chemistry, 43, 254; Marshall (1997), Nature 
10 Biotechnology, 15, 1249; International Patent Application WO 97/40462, Spectra 
Biomedical; and Schafer e/ a/, (1998), Nature.Biotechnology, 16, 33. 

Clinical trials have shown that patient response to treatment with pharmaceuticals is 

often heterogeneous. Thus there is a need for improved approaches to pharmaceutical agent 
design and therapy. 

15 Point mutations in polypeptides will be referred to as follows: natural amino acid 

(using I or 3 letter nomenclature) , position, new amino acid. For (a hypothetical) example 
"D25K" or **Asp25Lys" means that at position 25 an aspartic acid (D) has been changed to 
lysine (K). Multiple mutations in one polypeptide will be shown between square brackets 
with individual mutations sepaAted by commas, 

20 The present invention is based on the discovery of polymorphisms in P2X7- In 

particular, we have found thirty polymorphisms in the coding sequence of the P2X7 gene, 12 
of which lead to changes in the sequence of expressed protein. 

According to one aspect of the present invention there is provided a method for the 
diagnosis of a polymorphism in P2X7 in a human, which method comprises determining the 
25 sequence of the human at at least one polymorphic position and determining the status of the 
human by reference to polymorphism in P2X7. Preferred polymorphic positions are one or 
more of the following positions: 

positions 936, 1012, 1 147, 1343 and 1476 in the 5'UTR region of the P2X7 gene as defined by 
the position in SEQ ID NO: 1; 

30 positions 253, 488, 489, 760, 835, 853, 1068, 1096, 1315J324, 1405, 1448^49^^ 

1628 and 1772 in the coding region of the P2X7 gene as defined by the position in SEQ ID 
NO:2;md 
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positions 4780, 4845, 4849, 5021, 5554, 5579, 5535, 5845 and 6911 in the intron region of the 
P2X7 gene as defined by the position in SEQ ID NO: 3; 

positions 76 ,155, 245, 270, 276, 348, 357, 430, 433, 460, 490 and 496 in the P2X7 
polypeptide as defined by the position in SEQ ID NO: 4. 
5 The teim human includes both a human having or suspected of having a P2X7 

mediated disease and an asymptomatic human who may be tested for predisposition or 
susceptibility to such disease. At each position the human may be homozygous for an allele 
or the human may be a heterozygote. 

The term polymorphism includes single nucleotide substitution, nucleotide insertion 
1 0 and nucleotide deletion which in the case of insertion and deletion includes insertion or 
deletion of one or more nucleotides at a position of a gene and corresponding alterations in 
expressed protein. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the polymorphism in the in the 5 'UTR region of the P2X7 gene as 
15 defined by the position in SEQ ID NO: 1 is any one of the following: 

at position 936 is presence of C and/or A; at position 1012 is presence of T and/or C; 

at position 1 147 is presence of A and/or G; at position 1343 is presence of G and/or A; and 

at position 1476 is presence of A and/or G. 

In one embodiment of the invention preferably the method for diagnosis described 
20 herein is one in which the polymorphism in the coding region of the P2X7 gene as defined by 
the position in SEQ ID NO: 2 is any one of the following: 

at position 253 is presence of T and/or C; at position 488 is presence of G and/or A; 

at position 489 is presence of C and/or T; at position 760 is presence of T and/or G; 

at position 835 is presence of G and/or A; at position 853 is presence of G and/or A; 
25 at position 1068 is presence of G and/or A; at position 1096 is presence of C and/or G; 

at position 1315 is presence of G and/or G; at position 1324 is presence of C and/or T; 

at position 1405 is presence of A and/or G; at position 1448 is presence of C and/or T; 

at position 1494 is presence of A and/or G; at position 1513 is presence of A and/or C; 

at position 1628 is presence of G and/or T; and at position 1772 is presence of G and/or A. 
30 In one embodiment of the invention preferably the method for diagnosis described 

herein is one in which the polymorphism in the intron region of the P2X7 gene as defined by 

the positfon in SEQ ID NO: 3. is any one of the following: " 
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at position 4780 is presence of C and/or t; at position 4845 is presence of C and/or T; 
at position 4849 is presence of A and/or C; at position 5021 is presence of T and/or C; 
at position 5554 is presence of 3 and/or 4 repeats of GTTT (wherein position 5554 refers to 
the position of the G in the first unit repeat); 

at position 5579 is presence of G and/or C; at position 5535 is presence of A and/or T; 
at position 5845 is presence of C and/or T; and at position 691 1 is presence of T and/or C. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the polymorphism in the F2Xy protein as defined by the position in 
SEQ ID NO: 4. is any one of the following: val76ala, his I55tyr, val245gly, arg270his, 
arg276his, ala348tyr, thr357ser, pro430arg, ala433val, gln460arg, ser490gly and glu496ala. 

The method for diagnosis is preferably one in which the sequence is determined by a 
method selected from amp Ufication refractory mutation system, restriction fragment length 
polymorphism and primer extension. 

The status of the individual may be determined by reference to allelic variation at any 
1,2,3,4,5,6,7; 8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20,21, 22,23, 24,25, 26, 27,28, 
29, 30 or more positions. 

The test sample of nucleic acid is conveniently a sample of blood, bronchoalveolar 
lavage fluid, sputum, or other body fluid or tissue obtained from an individual. It will be 
appreciated that the test sample may equally be a nucleic acid sequence corresponding to the 
sequence in the test sampla^jthat is to say that all or a part of the region in the sample nucleic 
acid may firstly be amplified using any convenient technique e.g. PGR, before analysis of 
allelic variation. - 

It will be apparent to the person skilled in the art that there are a large number of 
analytical procedures which may be used to detect the presence or absence of variant 
nucleotides at one or more polymorphic positions of the invention. In general, the detection 
of allelic variation requires a mutation discrimination technique, optionally an amplification 
reaction and optionally a signal generation system. Table 1 lists a number of mutation 
detection techniques, some based on the PGR. These may be used in combination with a 
number of signal generation systems, a selection of which is listed in Table 2. Further 
amplification techniques are listed in Table 3. Many current methods forlhe detection of 
allelic variation are reviewed by Nollau e( ai, Clin. Chem. 43, 1 114-1 120, 1997; and in 
standard textbooks, for example "Laboratory Protocols for Mutation Detection'', Ed. by U. 
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Landegren, Oxford University Press, 1996 and "PGR", 2"*' Edition by Newton & Graham, 
BIOS Scientific Publishers Limited, 1997. 



Abbreviations: 





Amolificsition refractory imitation <;v<!fpm Hn^ar pYt^^ncinn 


APEX 


Arrayed primer extension 


ARMS™ 


Amolificaticn refractorv miit;?tinn w<3t-pm 


b-DNA 


Branched DNA 


bp 


base pair 


CMC 


Chemical mismatch cleavage 


COPS 


Competitive oligonucleotide nriminp <?v<5tem 


DGGE 


Denaturing gradient eel electroDhoresi<^ 


ELISA 


Enzyme Linked Immunosorbent Assay 


FRET 


Fluorescence resonance enerpv tranc:fpr 


LCR 


Li ffase chain reaction 


MASDA 


Multinle allele snecific diacmnQtir* 5?<2ct»v 


NASBA 


Nucleic acid sequence based amplification 


OLA 


Oligonucleotide ligation assay 1 


PGR 


Polymerase chain reaction 


PTT 


Protein tmncation test 


RFLP 


Restriction fragment length polymorphism 


SDA 


Strand displacement amplification 


SNP 


Single nucleotide polymorphism 


SSCP 


Single-strand conformation polymorphism analysis 


SSR 


Self sustained replication 


TGGE 


Temperature gradient gel electrophoresis 



5 Table 1 - Mutation Detection Techniques 

Genera!: DNA sequencing, Sequencing by hybridisation 

Scanning: PTT*, SSCP, DGGE, TGGE, Cleavase, Heteroduplex analysis, CMC, Enzymatic 
mismatch cleavage 

* Note: 'not useful for detection of promoter polymorphisms. * 
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Hybridisation Based 

Solid phase hybridisation: Dot blots, MASDA, Reverse dot blots, Oligonucleotide 
arrays (DNA Chips), 

Solution phase hybridisation: Taqman^M - US-52 10015 & US-5487972 (Hoffinann-La 
5 Roche), Molecular Beacons - Tyagi et al (1996), Nature Biotechnology, 14, 303; WO 
95/13399 (Public Health Inst., New York) 

Extension Based: ARMS*^^, ALEX*^^ - European Patent No. EP 332435 B 1 (Zeneca 
Limited), COPS - Gibbs et al (1989), Nucleic Acids Research, 17, 2347. 
Incorporation Based: Mini-sequencing, APEX 
10 Restriction Enzyme Based: RFLP, Restriction site generating PCR 
Ligation Based: OLA 
Other: Invader assay 

Table 2 - Signal Generation or Detection Systems 

Fluorescence: FRET, Fluorescence quenching. Fluorescence polarisation - United Kingdom 
15 Patent No. 2228998 (Zeneca Limited) 

Other: Chemiluminescence, Electrochemiluminescence, Raman, Radioactivity, Colorimetric, 
Hybridisation protection assay, Mass spectrometry 
Table 3 - Further Amplification Methods 

SSR, NASBA, LCR, SDA, b-DNA ' 
20 Table 4- Protein variation detection methods 
Immunoassay 
Immunohistology 
Peptide sequencing 

Preferred mutation detection techniques include ARMS™, ALEX*^^, COPS, Taqman, 
25 Molecular Beacons, RFLP, and restriction site based PCR and FRET techniques. 

Immunoassay techniques are known in the art e.g. A Practical Guide to ELIS A by D M 
Kemeny, Pergamon Press 1991; Principles and Practice of Immunoassay, 2"^^ edition, CP 
Price & D J Newman, 1997, published by Stockton Press in USA & Canada and by 
Macmillan Reference in the United Kingdom. 
30 Particularly preferred methods include ARMS™ and RFLP based methods. ARMS™ 

is an especially preferred method. 
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In a further aspect, the diagnostic methods of the invention are used to assess the 
pharmacogenetics of a drug acting at P2X7. 

Assays, for example reporter-based assays, may be devised to detect whether one or 
more of the above polymorphisms affect transcription levels and/or message stability. 

Individuals who carry particular allelic variants of the P2X7 gene may therefore exhibit 
differences in their ability to regulate protein biosynthesis under different physiological 
conditions and will display altered abihties to react to different diseases. In addition, 
differences arising as a result of allelic variation may have a direct effect on the response of an 
individual to drug therapy. The diagnostic methods of the invention may be useful both to 
predict the clinical response to such agents and to determine therapeutic dose. 

In a further aspect, the diagnostic methods of the invention, are used to assess the 
predisposition and/or susceptibility of an individual to diseases mediated by P2X7. This may 
be particularly relevant in the development of hyperlipoproteinemia and cardiovascular 
disease and the present invention may be used to recognise individuals who are particularly at 
risk from developing these conditions. 

In a further aspect, the diagnostic methods of the invention are used in the 
development of new drug therapies which selectively target one or more allelic variants of the 
P2X7 gene. Identification of a |ink between a particular allelic variant and predisposition to 
disease development or response to drug therapy may have a significant impact on the design 
of new drugs. Drugs may be designed to regulate the biological activity of variants implicated 
in the disease process whilst minimising effects on other variants. 

In a further diagnostic aspect of the invention the presence or absence of variant 
nucleotides is detected by reference to the loss or gain of, optionally engineered, sites 
recognised by restriction enzymes. 

According to another aspect of the present invention there is provided a human P2X7 
gene or its complementary strand comprising a variant allelic polymorphism at one or more of 
positions defined herein or a fragment thereof of at least 20 bases comprising at least one 
novel polymorphism. 

Fragments are at least 17 bases, more preferably at least 20 bases, more preferably at 
least 30 bases. 
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According to another aspect of the present invention there is provided a polynucleotide 
comprising at least 20 bases of the human P2X7 gene and comprising a polymorphism 
selected from any one of the following: 



Region 


Polymorphisia 




SEQ ID NO: 1 


5'UTR 


93 6 C-4A 




1012 T->C 




1147 A->G 




1343 G-^A 




1476 A->G 




Region 


Polymorphism 




SEQ ID NO: 2 


exon 2 


253 T->C 


exon 5 


488 G->A 




489 C->T 


exon 7 


760 T~>G 


exon 8 


835 G->A 




853 G->A 


exon 11 


1068 G^A 




1096 C->G ji 


exon 12 


1315 C->G 


exon 13 


1324 C->T 




1405 A^G 




14 4 8 C~>T 




14 94 A->G 




1513 A->C 




1628 G— >T 




1772 G->A 




Region 


Polymorphism 




SEQ ID NO; 3 


intron E 


4780 C->T 




4 84 5 C-^T 




4 84 9 A-+C 


int ron F 


5021 T->C 




5554 (GTTT) n=3, 4 
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5579 






553 5 


A->T 


x.ntron G 


5845 


C->T 




6911 


T->C 



According to another aspect of the present invention there is provided a polynucleotide 
comprising at least 20 bases of the human P2X7 gene and comprising an allelic variant 
selected from any one of the following: 



Region 


Variant 

SEQ ID NO: 1 


S'UTR 


936 A 




1012 C 




1147 G 




134 3 A 




14 76 G 



Region 


Variant 




SEQ ID NO: 2 


exon 2 


253 C 


exon 5 


488 A 




489 T 


exon 7 


76 0 G 


exon 8 


835 A 




853 A 


exon 11 


1068 A 




1096 G 


exon 12 


1315 G 


exon 13 


1324 T 




1405 G 




1448 T 




1494 G 




1513 C 




1628 T 




1772 A 



Region 


Variant 

SEQ ID NO: 3 


intron E 
* 


4780 T 
4 84 5 T 
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4849 C 


intron F 


5021 C 

55S4 {GTTT}„,n:r4 
S579 C 
5535 T 


intron G 


5845 T 
6911 C 



According to another aspect of the present invention there is provided a human P2X7 
gene or its complementary strand comprising a polymorphism, preferably corresponding with 
one or more the positions defined herein or a fragment thereof of at least 20 bases comprising 
at least one polymorphism. 
5 Fragments are at least 17 bases, more preferably at least 20 bases, more preferably at 

least 30 bases. 

The invention further provides a nucleotide primer which can detect a polymorphism 

of the invention- 
According to another aspect of the present invention there is provided an allele 
10 specific primer capable of detecting a P2X7 gene polymorphism, preferably at one or more of 

the positions as defined herein. 

An aUele specific primer is used, generally together witb a const^ 

amplification reaction such as a PGR reaction, which provides the discrimination between 

alleles through selective amplification of one allele at a particular sequence position e.g. as 
15 used for ARMS'^^^ assays. The allele specific primer is preferably 17- 50 nucleotides, more 

preferably about 17-35 nucleotides, more preferably about 17-30 nucleotides. 

An allele specific primer preferably corresponds exactly with the allele to be detected 

but derivatives thereof are also contemplated wherein about 6-8 of the nucleotides at the 3 ' 

terminus corrp„<;nnnH with the allele to be detprlRH nnH wh^rp-in un TO <^ii^u o*. nr^ q K a 

J — ._ — — ^ w-w* t« , T vtAX "JJ w ^ ) '^V-'ll C4.J ^Jt^ J ^5 "Tj 

20 2, or 1 of the remaining nucleotides may be varied without significantly affecting the 

properties of the primer. 

Primers may be manufactured using any convenient method of synthesis. Examples of 

such methods may be found in standard textbooks, for example "Protocols for 

Oligonucleotides and Analogues; Synthesis and Properties/' Methods in Molecular Biology 
25 Series; Volume 20; Ed. Sudhir Agrawal, Humana ISBN: 0-89603-247-7; 1993; Edition. If 

required the primer(s) may be labelled to facilitate detection. 
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According to another aspect of the present invention there is provided an allele- 
specific oligonucleotide probe capable of detecting a P2Xj gene polymorphism, preferably at 
one or more of the positions defined herein. 

The allele-specific oligonucleotide probe is preferably 17- 50 nucleotides, more 
5 preferably about 17-35 nucleotides, more preferably about 17-30 nucleotides. 

The design of such probes will be apparent to the molecular biologist of ordinary skill 
Such probes are of any convenient length such as up to 50 bases, up to 4G bases, more 
conveniently up to 30 bases in length, such as for example 8-25 or 8-15 bases in length. In 
general such probes will comprise base sequences entirely complementary to the 
10 corresponding wild type or variant locus in the gene. However, if required one or more 

mismatches may beintroduced, provided that the discriminatory power of the oligonucleotide 
probe is not unduly affected. The probes of the invention may carry one or more labels to 
facilitate detection. 

According to another aspect of the present invention there is provided an allele 
1 5 specific primer or an allele specific oligonucleotide probe capable of detecting a P2X7 gene 
polymorphism at one of the positions defined herein. 

According to another aspect of the present invention there is provided a diagnostic kit 
comprising an allele specific oligonucleotide probe of the invention and/or an allele-sp|cific 
primer of the invention. 

20 The diagnostic kits may comprise appropriate packaging and instructions for use in the 

methods of the invention. Such kits may further comprise appropriate buffer(s) and 
polymerase(s) suchas themiostable polymerases, for example taq polymerase. 

In another aspect of the invention, the polymorphisms of this invention may be used as 
genetic markers in linkage studies. This particularly applies to the polymorphisms of 

25 relatively high frequency. The P2X7 gene is on chromosome 12q24 (Buell et al, Receptors 
and Channels, 1998, 5,347-354). Low ft-equency polymorphisms may be particularly useful 
for haplotyping as described below, A haplotype is a set of alleles fou 

polymorphic sites (such as within a gene) on a single (paternal or mate™^ If 
recombination within the gene is random, there may be as many as 2" haplotypes, where 2 is 
30 the number of alleles at each SNP and n is the number of SNPs. One approach to identifying 
mutations or polymorphisms which are correlated with clinical response is to carry out an 
association study using all the haplotypes that can he identifiedinthc population of interest/ 
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The frequency of each haplotype is limited by the frequency of its rarest allele, so that SNPs 
with low frequency alleles are particularly useful as markers of low frequency haplotypes. As 
particular mutations or polymorphisms associated with certain clinical features, such as 
adverse or abnormal events, are likely to be of low frequency within the population, low 
5 frequency SNPs may be particularly useful in identifying these mutations (for examples see: 
Linkage disequilibrium at the cystathionine beta synthase (CBS) locus and the association 
between genetic variation at the CBS locus and plasma levels of homocysteine, Ann Hum 
Genet (1998) 62:481-90, De Stefano V, Dekou V, Nicaud V, Chasse JF, London J, Stansbie 
D, Humpiiries SE, and Gudnason V; and Variation at the von willebrand factor (vWF) gene 
0 locus is associated with plasma vWF:Ag levels: identification of three novel single nucleotide 
polymorphisms in the vWF gene promoter. Blood (1999) 93:4277-83, Keightley AM, Lam 
YM, Brady JN, Cameron CL, Lillicrap D). 

According to another aspect o f the present invention there is provided a com^^ 
readable medium comprising at least one novel sequence of the invention stored on the 
5 medium. The computer readable medium may be used, for example, in homology searching, 
mapping, haplotyping, genotyping or pharmacogenetic analysis. 

According to another aspect of the present invention there is provided a method of 
treating a human in need of treatment with a drug acting at P2X7 in which the method 

■ ■ ■ i ■ 

comprises: ^■ 
i) diagnosis of a polymorphism in P2X7 in the human, which diagnosis preferably 
comprises determining the sequence at one or more of the following positions: 
positions 936, 1012, 1 147, 1343 and 1476 in theS'UTR regionof the P2X7 gene as defined by 
the position in SEQ ID NO: 1; \ - 

positions 253, 488, 489, 760, 835, 853, 1068, 1096, 1315, 1324, 1405, 1448, 1494, 1513, 
1 628 and 1772 in the coding region of the P2X7 gene as defined by the position in SEQ ID 
NO: 2; and 

positions 4780, 4845, 4849, 5021, 5554, 5579, 5535, 5845 and 691 1 in the intron region of the 
P2X7 gene as defined by the position in SEQ ID NO: 3; and 

positions 76 ,155, 245, 270, 276, 348, 357, 430, 433, 460,490 and 496 in the P2X7 
polypeptide as defined by the position in SEQ ID NO: 4; 

and determining the status of the human by reference to polymorphism in P2X7 ; and ' 




Preferably determination of the status of the human is clinically useail. Examples of 
clinical usefulness include deciding which drug or drugs to administer and/or in deciding on 
the effective amount of the drug or dmgs. The term "drug acting at P2X7" means that drug 



ii) administering an effective amount of the drug. 
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5 binding with F2X^ in humans is an important part of a drug exerting its pharmceutical effect 
in man. 

According to another aspect of the present invention there is provided use of a drug 
acting at P2X7 in preparation of a medicament for treating a disease in a human diagnosed as 
having a polymorphism therein, preferably at one or more of the positions defined herein. 
1 0 According to another aspect of the present invention there is provided a 

pharmaceutical pack comprising P2X7 drug and instmctions for administration of the drug to 
humans diagnostically tested for a polymorphism therein, preferably at one or more of the 
positions defined herein. 



15 of human P2X7 polypeptide comprising at least one of the following: 

a alanine at position 76 of SEQ ID NO'4; 

a tyrosine at position 155 of SEQ ID NO 4; 

a glycine at position 245 of SEQ ID ^NO 4; 

a histidine at position 270 of SEQ ID NO 4; 
20 a histidine at position 276 of SEQ ID NO 4; 

a tyrosine at position 348 of SEQ ID NO 4; 

a serine at position 357 of SEQ ID NO 4; 

a arginine at position 430 of SEQ ID NO 4; 

a valine at position 433 of SEQ ID NO 4; 
25 a arginine at position 460 of SEQ ID NO 4; 

a glycine at position 490 of SEQ ID NO 4; and 

a glutamic acid at position 496 of SEQ ID NO 4; 

or a fragment thereof comprising at least 1 0 amino acids provided that the fragment comprises 
at least one allelic variant. 
30 Fragments of polypeptide are at least 10 amino acids, more preferably at least 15 

amino acids, more preferably at least 20 amino acids. 



According to another aspect of the present invendon there is provided an allelic variant 
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According to another aspect of the present invention there is provided an antibody 
specific for an allelic variant of human P2X7 polypeptide as described herein. 

Antibodies can be prepared using any suitable method. For example, purified 
polypeptide may be utilized to prepare specific antibodies. The term * Wibodies" is meant to 
5 include polycional antibodies, monoclonal antibodies, and the various types of antibody 
constructs such as for example F(ab')2, Fab and single chain Fv. Antibodies are defined to be 
specifically binding if they bind the allelic variant of P2X7 with a of greater than or equal 
to about 10^ M"'. Affinity of binding can be determined using conventional techniques, for 
example those described by Scatchard et al., Ann. M. 7. Acad ScL, 51:660 (1949). 
10 Polyclonal antibodies can be readily generated ft-om a variety of sources, for example, 

horses, cows, goats, sheep, dogs, chickens, rabbits, mice or rats, using procedures that are 
well-known in the art. In general, antigen is administered to the host animal typically through 
parenteral injecfion. The immunogenicity of antigen may be enhanced through the use of an 
adjuvant, for example, Freund's complete or incomplete adjuvant. Following booster 
15 immunizations, small samples of serum are collected and tested for reactivity to antigen. 
Examples ofvarious assays useful for such determination include those described in: 
Antibodies: A Laboratory Manual, Harlow and Lane (eds.), Cold Spring Harbor Laboratory 
Press, 1 988; as well as procedures such as countercurrent immuno-electrophoresis (CIEP), 
radioimmunoassay, radioimmunopreclpitation, enzyme-linked immuno-sorbent assays 
20 (ELISA), dot blot assays, and sandwich assays, see U.S. Patent Nos, 4,376,1 10 and 4,486,530, 
Monoclonal antibodies may be readily prepared using well-known procedures, see for 
example, the procedures described in U.S. Patent Nos. RE 32,0 11, 4,902,614, 4,543,439 and 
4,41 1,993 ; Monoclonal Antibodies, Hybridomas:y4 A^ew Dimension in Biological Analyses, 
Plenum Press, Kennett, McKeam, and Bechtol (eds.), (1980). 
25 The monoclonal antibodies of the invention can be produced using alternative 

techniques, such as those described by Alting-Mees et ah, "Monoclonal Antibody Expression 
Libraries: A Rapid Alternative to Hyhvidomas"\ Strategies in Molecular Biology 2,- 1-9 (1990) 
which is incorporated herein by reference. Similarly, binding partners can be constructed 
using recombinant DNA techniques to incorporate the variable regions of a gene that encodes 
30 a specific binding antibody. Such a technique is described in Larrick et ^\.\ Biotechnology, 7: 
394 (1989). 
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Once isolated and purified, the antibodies may be used to detect the presence of 
antigen in a sample using established assay protocols, see for example Practical Guide to 
ELiSA'* by D. M. Kemeny, Pergamon Press, Oxford, England. 

According to another aspect of the invention there is provided a diagnostic kit 
5 comprising an antibody of the invention. 

According to another aspect of the present invention there is provided a polynucleotide 



comprising any one of the following twenty six P2X7 haplotypes: 
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According to another aspect of the present invention there is provided a human P2X7 
polypeptide comprising one of the following eighteen combinations of alleleic variant 
determined amino acids based on positions identified in SEQ ID NO: 4: 
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According to another aspect of the present invention there is provided a polynucleotide 
which encodes any human P2X/polypeptide combination of allelic variants defined herein. 

According to another aspect of the present invention there is provided a method for the 
diagnosis of asthma or a predisposition thereto in a human, which method comprises 
determining the sequence of the human at one or more of the following positions: 
position 15 13 in the coding region of the ?1X-, gene as defined by the position in SEQ ID NO: 
2; and 

position 496 in the P2X7 polypeptide as defined by the position in SEQ ID NO: 4; 
wherein presence of A at position 15 13 or Glu at position 496 is diagnostic of asthma or a 
predisposition thereto. 

The invention will now be illustrated but not limited by reference to the following 
Examples, Ail temperatures are in degrees Celsius. * 
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In the Examples below, unless otherwise stated, the following methodology and 
materials have been applied. 

AMPLITAQ™ ^available from Perkin-Elmer Cetus, is used as the source of 
thermostable DNA polymerase. 
5 General molecular biology procedures can be followed from any of the methods 

described in "Molecular Cloning - A Laboratory Manual" Second Edition, Sambrook, Fritsch 
and Maniatis (Cold Spring Harbor Laboratory, 1989). 

Electropherograms were obtained in a standard manner: data was collected by ABI377 
data collection software and the wave form generated by ABI Prism sequencing analysis 
10 (2.1.2). 
Example 1 

Identification of Polymorphisms 
1. Methods 

DNA Preparation 

15 DNA was prepared from frozen blood samples collected in EDTA following protocol I 

(Molecular Cloning: A Laboratory Manual, p392, Sambrook, Fritsch and Maniatis, 2"^ 

Edition, Cold Spring Harbor Press, 1989) with the following modifications. The thawed 

blood was diluted in an equal volume of standard saline citrate instead of phosphate buffered 

■ ' I ■ 

saline to remove lysed red blood cells. Samples were extracted with phenol, then 

20 phenol/chloroform and then chloroform rather than with three phenol extractions. The DNA 

was dissolved in deionised water 

Template Preparation 

Templates were prepared by PGR using the oligonucleotide primers and an^^ 
temperatures set out below. The extension temperature was 72'' and denaturation temperature 
25 94°. Generally 50 ng of genomic DNA was used in each reaction and subjected to 35 cycles 
of PCR. Where described below, the primary fragment was diluted 1/100 and two microhtres 
were used as template for ampli fication of secondary fragments. PCR was performed in two 
stages (primary fragment then secondary fragment) to ensure specific amplification of the 
desired target sequence. 



30 Polymorphisms in P2X7 



Region 


Size 


Polymorphism 


protein 


frequency 








change 
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exon 1 



146bp 



936 C->A 
1012 T->C 
1147 A-^G 
1343 G->A 
1476 A->G 



3/56 
42/56 
3/56 
2/52 
35/52 



exon 2 



mtron B 



exon 3 



incron C 



exon 4 



mtron D 



exon 5 



mtron E 



exon 6 



mtron F 



168bp 



l.lkb 



68bp 



4 . 7lcb 



253 T-->C 



val76ala 



73 bp 



l.SJcb 



95bp 



2.8k;b 



488 G-^A 
4 89 C— >T 



2/54 



silent 
hislSS tyr 



80 bp 



617bp 



4780 C->T 
4 84 5 C^T 
4 84 9 A^C 



5021 T->C 

5 5 54 (GTTT)n=3, 4 

5579 G->C 

5535 A-->T 



2/54 
17/38 



39/52 
39/52 
28/36 



1/34 
n=3 , 14/40 
26/40 

1/44 



12 9 bp 



760 T->G 



val245gly 



1/40 



intron G 



1 .3kb 



5845 C->T 
6911 T->C 



2/40 
33/50 



intron H 



exon 



intron X 



exon 10 



intron J 



exon 11 



intron K 



exon 12 



136bp 



835 G->A 
8 53 G->A 



arg270his 
arg276his 



16/52 
1/54 



91bp 



1 ,7kb 



64bp 



84bp 



149bp 



1068 G->A 
1096 C->G 



lOlbp 



1315 C->G 



ala348tyr 
thr357ser 



18/62 
5/66 



pro4 3 0arg. 
splice site 



4/66 
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Positions in the 5 ' UTR refer to SEQ ID NO: 1 . 
Positions in exons refer to SEQ ID NO: 2. 
Positions in introns refer to SEQ ID NO: 3, 
5 Positions in protein refer to SEQ ID NO: 4. 
Example 2 

Haplotype analysis 

a) The following allele frequencies were determined in a Swedish population 



SEQ ID NO 


Position 


Frequency 
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b) Haplotype data. 

Analysis of 15 Swedish families with at least one asthmatic child using primer 
extension (SNapShot^M, Perkin Elmer) genotyping and GeneHunterTM analysis demonstrated 
the following haplotypes: 



100203 GB-2 



mi 



-20- 





1012 


489 


S579 


835 


853 


1068 


1096 


140S 


1S13 


Frequency 
n/S8 


1 


T 


T 


C 


G 


G 


A 


G 


A 


A 


1 


2 


C 


C 


G 


G 


G 


G 


C 


A 


A 


3 


3 


c 


c 


C 


A 


G 


G 


C 


A 


C 


1 


4 


c 


T 


G 


G 


G 


A 


C 


G 


A 


5 


5 


c 


C 


G 


G 


G 


A 


G 


A 


A 


1 


6 


c 


C 


C 


A 


G 


G 


C 


A 


A 


8 


7 


T 


T 


G 


G 


G 


A 


c 


G 


A 


1 


8 


c 


T 


C 


G 


G 


G 


c 


A 


A 


3 


9 


c 


, C 


C 


G 


G 


A 


c 


A 


A 


3 


10 




T 


G 


G 


G 


G 


c 


A 


C 


2 


11 


T 


C 


G 


G 


G 


A 


c 


A 




2 


12 


C 


T 


C 


G 


G 


G 


c 


A 


G 


3 


13 


T 


C 


C 


G 


G 


A 


c 


A 


A 


4 


14 


T 


C 


C 


G 


G 


G 


c 


' A 


G 


1 


15 


C 


T 


C 


G 




A 


c 


A 


A 


2 


16 


T 


T 


C 


G 


G 


A 


c 


G 


A 


1 


17 


C 


C 


G 


G 


G 


A 


c 


G 


A 


2 


13 


T 


G 


G 


A 


A 


G 


c 


. A 


A 


2 


19 


C 


c 


C 


G 


G 


G 




A 


A 


1 


20 


T 


,C 


C 


G 


G 


G 


G 


A 


A ■ 


1 


21 


e 




C 


A 


G 


G 


C 


A 


A 


4 


22 


c 


C 


C 


G 


. k. 


G 


C 


A 


c 


3 


23 


c 


■ T 


,G 


G 


A 


A 


G 


G 


A 


1 


24 




T 


G 


G 


G 


A 


G 


G 


A 


1 


25 


c 


T 


C 


G 




G 




A 


A 


1 


26 


c 


C 


C 


G 


G 




C 


A 


A 


1 



This results in the following proteins; 
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c) Analysis 



Transmissioa disequilibrium testing (TDT) of 15 Swedish families with at least one 
asthmatic child showed a significant excess of transmission of the A allele at position 1513 
(giving Glu at position 496 of SEQ ID NO 4 protein) to offspring with asthma, p=O.0l25. 

Ben J. Gu, Weiyi Zhang, Rebecca A, Worthington, Ronald Sluyter, Phuong Dao-Ung, 
Steven Petrou, Julian A. Barden, and James S, Wiley, J. Biol. Ghem. (2001) 276: 1 1 135- 
1 1142 reported that Ala at 496 (C at 1513) leads to loss of function in P2X7. Only one 
polymorphism was reported since they only analysed the final exon for SNPs. Loss of P2X7 
receptor function would possibly be protective against astlima and is compatible with our 
TDT data. 
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Claims 

1 A method for the diagnosis of a polymorphism in P2X7 in a human, which method 
comprises determining the sequence of the human at one or more of the following positions: 
positions 936, 1012, 1 147, 1343 and 1476 in the 5'UTR region of the P2X7 gene as defined by 
the position in SEQ ID NO: 1; 

positions 253, 488, 489, 760, 835, 853, 1068, 1096, 1315, 1324, 1405, 1448, 1494, 1513, 
1628 and 1772 in the coding region of the P2X7 gene as defined by the position in SEQ ID 
NO: 2; and 

positions 4780, 4845, 4849, 5021, 5554, 5579, 5535, 5845 and 691 1 in the intron region of the 
P2X7 gene as defined by the position in SEQ ID NO: 3 ; 

positions 76 , 1 55, 245, 270, 276, 348, 357, 430, 433, 460, 490 and 496 in the P2X, 

polypeptide as defined by the position in SEQ ID NO: 4; 

and detennining the status of the human by reference to polymorphism in P2X7. 

2 Use of a diagnostic method as defined in claim 1 to assess the pharmacogenetics of a 
drug acting at P2X7. 

3 A polynucleotide comprising at least 20 bases of the human P2X7 gene and comprising 
an allelic variant selected from any one of the following: 



Region 


Variant 

SEQ ID NO: 1 


5 ' UTR 


936 A 




1012 C 




1147 G 




1343 A 




1476 G 



Region 


Variant 




SEQ ID KO: 2 


exon 2 


253 C 


exon 5 


488 A 




489 T 


exon 7 


760 G 


exon 8 


835 A 




853 A 


exon 11 


1068 A 




1096 G 


exon 12. 


1315 G 
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exon 13 


1324 T 




1405 G 




1448 T 




1494 G 




1513 C 




1628 T 




1772 A 




Region 


Variant 




SEQ IP NO: 3 


intron E 


4780 T 




4845 T 




4849 C 


intron F 


S021 C 




5554 (GTTT)^,n=4 




5S79 C . 




5535 T 


intron G 


5845 T 




6911 C 



4 A nucleotide primer which can detect a polymorphism as defined in claim L 

5 An allele specific primer capable of detecting a P2X7 gene polymorphism as defined in 
claim 1. 

5 6 An allele-speGific oligonucleotide probe capable of detecting a P2X7 gene | 
polymorphism as defined in claim 1. 

7 Use of a P2X7 gene polymorphism as defined in claim 1 as a genetic marker in a 
linkage study. 

8 A method of treating a human in need of treatment with a dmg acting at P2X7 in 
0 which the method comprises: 

i) diagnosis of a polymorphism in P2X7 in the human, which diagnosis preferably 
comprises determining the sequence at one or more of the following positions: 
positions 936, 1012, 1 147, 1343 and 1476 in the 5'UTR region of the P2X7 gene as defined by 
the position in SEQ ID NO: 1 ; 
5 positions 253, 488, 489, 760, 835, 853, 1068, 1096, 1315, 1324, 1405, 1448, 1494, 1513, 
1628 and 1772 in the coding region of the P2X7 gene as defined by the position in SEQ ID 
NO: 2; and 
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positions 4780, 4845, 4849, 5021, 5554, 5579, 5535, 5845 and 6911 in the intron region of the 
P2X7 gene as defined by the position in SEQ ID NO: 3; and 
positions 76 ,155, 245, 270, 276, 348, 357, 430, 433, 460, 490 and 496 in the P2X7 
polypeptide as defined by the position in SEQ ID NO: 4; 
5 and determining the status of the human by reference to polymorphism in ?2Xy ; and 
ii) administering an effective aiiiount of the drug, 

9 An allelic variant of human polypeptide comprising at least one of the following: 

a alanine at position 76 of SEQ ID NO 4; 

a tyrosine at position 155 of SEQ ID NO 4; 
10 a glycine at position 245 of SEQ ID NO 4; 

a histidine at position 270 of SEQ ID NO 4; 

a histidine at position 276 of SEQ ID NO 4; 

a tyrosine at position 348 of SEQ ID NO 4; 

a serine at position 357 of SEQ ID NO 4; 
15 a arginine at position 430 of SEQ ID NO 4; 

a valine at position 433 of SEQ ID NO 4; 

a arginine at position 460 of SEQ ID NO 4; 

a glycine at position 490 of SEQ ID NO 4; and 

a glutamic acid at position 496 of SEQ ID NO 4; 
20 or a fragment thereof comprising at least 10 amino acids provided that the fragment comprises 
at least one allelic variant. 

10 An antibody specific for an allelic variant of human P2X, polypeptide as defined in 
claim 9, 

11. A Dolvnucleotide comnrisinff anvnnfinffhp fi^ii/^^r^tnrr Hx/^tn^^, f.;^ nov 
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12 A human P2X7 polypeptide comprising one of the following eighteen combinations of 
alleleic variant determined amino acids based on positions identified in SEQ ID NO: 4: 
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1 3 A polynucleotide which encodes any human P2X, polypeptide as defined in claim 1 2. 

1 4 A method for the diagnosis of asthma or a predisposition thereto in a human, which 
method comprises determining the sequence of the human at one or more of the following 
positions: 

5 position 15 1 3 in the coding region of the P2X7 gene as defined by the position in SEQ ID NO: 
2; and - 

position 496 in the P2X7 polypeptide as defined by the position in SEQ ID NO: 4; 
wherein presence of A at position 15 13 or Glu at position 496 is diagnostic of asthma or a 
predisposition thereto. 
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TITLE:C hemical Compounds 



5 This invention relates to polymorphisms in the human P2X7 gene and corresponding 

novel allelic polypeptides encoded thereby. The invention also relates to methods and 
materials for analysing allelic variation in the PIX^ gene, and to the use of P2X7 
polymorphism in treatment of diseases with P2X7 drugs. 
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SEQUENCH MSTXNG 

<110> AstraZeneca AB 

<120> Chemical Compounds 

<130> morten 

<140> 
<141> 

<160> 4 

<170> Patentin Ver. 2.1 

<210> 1 

<211> 4900 

<212> DNA 

<213> Homo sapiens 

<400> 1 

gactcactat agggagaccg gcagatctga tatcatcgcc actgtggatc cgaattctag 60 

aaggcctatg ttctaagcat caggctttac ctgtgaatct cctcttttta cagatgaaga 12 0 

tgactgtatc actcagattc Gcggcaggaa agcaatggca tactcaagtg gggtaactaa 180 

tgatggaacc atttacaaag gtgtggacag agttaagaaa aagGaatagg agatagcgag 24 0 

cttcctgggg ctggtaagag tggggagccc ttacGactcc caggactaaa ggagggagtg 300 

gtgecGagaa gccctgccta tatgcaactg agaagggcag ggccagggag tcacgtccat 360 

cctcactgct ctcGagtctc ctgaactgga agccagaagg tgaggggaac cctgatgcag 420 

tttgtatgtg tgagaaagta caattagttt agactgaaaa actgaaaatG taccGggcGa 480 

cttagcaggo tggaataaca gaaatggatc aagccagctg taaagataac agggaacaat 54 0 

aattctctgt agctgtaaag tgataataca accctgcatc tttgagtgac tgctgaaaca 600 

ttgtGGttta aaatcagaga ccttcagaaa cttcgqtgtt tgaaattaca tgactaagac 66 0 

tgaaatattc caattttgcc cggaagattt aagtcatctt gacacagaga agcagcctca 720 

atGCacaact caggagcaga gcttcagata aagattttce ggacaGattt gacatgtatc 780 

ttagctatgt tgcttcccag gaaacagggc cctgggtcct ctttgcaatc cagaotgaag 840 

ttgactgctt tgtacaaacc tgttttgctt tgagtccafcc aaaacatgac ttcatttaga 900 

ttttatctca actGcacttt cctcggaatc ctatactaaa ttgctgtttt cctttgtttg 960 

gtgatgcgcg tagctcttct ggtgggtggt gtccctcact gaataggtca ataaaoctaa 1020 

ccttgttgga ctgccactgt gtccctggtg atctttggct gattggtcta ggtcatagat 1080 

cgacctgccg gggtgcagag gagggtggag agtaactGag agggtGaagc atgaaagatG 1140 

tggcagaaaa ataaagcccc tccaccccca ccacccctac Gcctgcaaat ctgatttccc 1200 

ccaccaactg cagaccagag tatcataagg ggcggtggaa gaagaggggg agatcttcat 1260 
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ttacccagag ctcctataca tcaggggctg aataaagggt tgtagaaatg aatgaatcaa 1320 
tctctgagtg gggcttcagg cagtggaaag atctcagtcc ttttctgagg cataatggaa X380 
gctcccagcc ttgtgacatt tgcaaggctg cccctttctc ccaagagaca tgagaccaaa 1440 
aaagtgaaag gaaagggggg aaaagggaga attctaaaaa tgcccatcct ctgaacacca 1500 
tctttgtgta ggcatctggg ggaggccagc tggggtgagg tcatctgcca gccaggcccg 1560 
taggacttgg cgcttcttgt ttafccacagc cacatgtggg gccactgcca gggcccgccc 1620 
caactctgca gtcattggag gagcttgaag ttaaagactc ctgctaaaaa ccagtacgtt 1680 
tcattttgca gttactggga gggggcttgc tgtggccctg tcaggaagag tagagctctg 1740 
gtccagctcG gcgcagggag ggaggctgtc accatgccgg cctgctgcag ctgcagtgat 18 00 
gttttccagt atgagacgaa caaagtcact cggatccaga gcatgaatta tggcaccatt 1860 
aagtggttct tccacgtgat catcttttcc tacgtttggt aagtgggatc tggggaggac 1920 
ccagatctct gcagtggccg acagcacaga aagccccagc gggcagcttc aggtgcacat 1980 
tctgaatctc acatggtttt cgaatctgag. aGgtgctctc acagccagct gggcgggagg 2040 
gaggaagcag cagcaggcaa gaggaaacgg tgccaggctg cagcagagag aagccacagg 2100 
acaagcggga ttcctttctg ctctacttca ggcccgccag ggcgcgcaag gcagggcgtg 2160 
cctggggaag gtaggaaagc gcagggcaac accctggatc cccagggagg aggcgaggat 2220 
ctcagggcac gcctggtgat catgctggca tctgagtcac catgcttggg aggaatagga 2280 
ccaggcttga aaatgcgt-ta taactttagg tcctcaccaa cgtcaggaag gccctgcttt 2340 
ttggtttttg tttcttctaa aagaaactta ccgagatata atttatacac catacaattg 24O0 
acccatttaa agggtaccat ttaatgattt; tcagattatt cccagagttg tgcaaccatc 2460 
cccacaatca attttagaat attttaatcg actcaaaagg aatcccacac tccttcacca 2520 
tcatttccaa cactgttcct CGtccttccc aCGcatcaac ttcctttctg cctctac^ga 2580 
tttgccgatt ctggacattt catataaatg gaatcacata atatgtggtc ctttgtggca 2640 
cttagcgtgt tttcaatgct catccatgtt gtagcatgtg ttgatacttc attcaatttt 2700 
tttttttaaa gagacggggt ctcactattt tgtCGaggct ggtctcaaac tcctggactc 2760 
aagtgatctg cctgcctcgg cctcccaaag tgtcaggatt acaggcgtga gccattgcac 2820 
ccggctgata cttcattGct ttttacggct gagtagtact ccattgcatg gatagaccac 2 8 80 
ttttttctat ccattcatGG attgatggac attggggttg tttctctttt ttggctatca 2940 
tgaataatgc cacatgaaca tttgtgtaca aggttttatg tggatatata ttctcctttc 3000 
tctcgaatat gtacctaaga gtaaaaattg ctaggtcata tgttaactat gtttcacctt 3060 
tggggggaat gtggagctga atttcacagc agctgcagtt ttttacattc ctatcagcag 3120 
agtatgaggg atccaatttc tccacatcct cacGaacgct tgttatcgtc tgtctttttg 3180 
ggttttgttg ttgtcatttt gtttttgtct ttgagatgaa gtcttgctct gttgcccagg 3240 
ctggagtgca gtggcgcaat cttggctcac tgcaacctcc aGctccccgg ttcaagcgat 3300 
tGtcctgcct cagcctcatg agtagctggg attacaggtg tgcgtcacca ctcctcacta 33 60 
atttttgtat ttttagtaga gatgaggttt cgccatgtag gccaggctgg tctcaaactc 3420 
ctgacctcaa gtgatcGgcc caccttggcc tcccaaagtg ctgggattacagg cat gage 3480 
cgccacgccc ggctgattgt ctgtcttttt tattatagcc atgctagtgg gtgtgaagtg 354 0 
gtagttcatt gtggttgtga tttgaatttc cctgatggtg agtgcctctt attctctgtg 3600 
ctgattgata a.tgatgatga aggcaatttg tatctataga gtggcagtgt agtttactaa 3660 
gagttagggt aacttatttc atagtactgg ctatgtcttc tgggccaagt cattaacttc 3720 
tctgagcctc agtttctgca cctgttcata gggttgtggc aattaaccaa aaaaaaaagg 3780 
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)>^\ catgaacagc ccttatcatg atgactgaca taggataaga gcfcccataac tagtatctat 3840 

^'/ ■J-j ttttaaaaat aatcttttta agtctgggag tggtggctca cacctgtaat cccaacactt 3900 

tgggaggccg aggcgggtgg atcacgaagt caggagtttg agaccagcct ggccaatatg 3960 
gtgaaacccc atctctacta aaaatacaaa aattagtggg gagtggtggt gcacacctgt 4020 
5 aatcccagct actagggagg ctgaggcagg agaatcgctt gaacccggag gcggaggttg 4080 
cagtgagccg agatcaagcc actgcactcc agcctgggtg acagagcaag actccatctc 4140 
aaaataataa taatagtaat aactttfcttg attatataat agtatatatg tatataaaat 4200 
acatgtatgt atttttatct atatcctctg ctctgaccct caaagtaaec acgtccaagt 4260 
tcaggatttg aaatctggaa acgtggattc aaaaatcctt cacctctttg agccttggtt 4320 
10 tcatcatctg taaaatgggg agaattgttg ataggaatat taaatgaact aataaatgca 43 8 0 
aagctgtttg agaaatatat ggcatatagt aatccctgat taagtgttag ttcttattat 4440 
taataatgct attattagga ttattattat tcgattcata tgtttactgt tcaacaaata 4500 
ttgaatgata aacatatatg ctgggtccgg catggtggcc catgcctgta attccagcac 4560 
tttgggaggc caaggcgggc aggtcacttg aggtcaagag tttgagacca gcctggccaa 4620 
15 tgtggtggaa actccatctg tgctaaaaat acaaaaatta gccgggcatg gtggtgggtg 46 80 
cctgtaatcc cagctactcg ggaggctgag acaggagaat cacttgaacc caggaggtgg 4740 
; aggttgcagt gagccaagat tgcaccactg cactccagGc tgagccacag agcaagactc 4800 

tgtctcaaaa aaaaaaaaaa aaaatatata tatatatata tatatafcata gtatttttag 4860 
tagagatggg gttttgccat ctcttatata tttttatatt 4900 

' 20 

<210> 2 ■ . ■ 
<211> 1853 

<212> DMA ; 
25 <213 > Homo sapiens 

. ■■ <4oo> 2 ^- 

aaaaegcagg gagggaggcc gtcaccatgc cggcctgctg cagctgcagt gatgttttcc 60 

agtatgagac gaacaaagtc actcggatcc agageatgaa ttatggcacc attaagtggt 120 

30 tcttccacgt gatcatcttt tCGtacgttt gctttgctct ggtgagtgac aagctgtaec 180 

agcggaaaga gcctgtcatc agttGtgtgc acaccaaggt gaaggggata gcagaggtga 240 

aagaggagat cgtggagaat ggagtgaaga agttggtgGa cagtgtcttt gacaca^cag 300 

actacacGtt ccGtttgcag gggaactcct tcttcgtgat gacaaacttt ctcaaaacag 360 

aaggccaaga gcagoggctg tgtcccgagt ktcccaGccg caggacgctc tgtccctctg 420 

35 accgaggttg taaaaaggga tggatggacc cgcagagcaa aggaattcag accggaaggt 480 

gtgtagtgca tgaagggaac cagaagacct gtgaagtctc tgcctggtgG ccGafecgagg 540 

cagtggaaga ggocGGCcgg cctgctctct tgaacagtgc Ggaaaacttc accgtgctca 600 

tGaagaacaa tatcgacctc cGcggccaca actacaccac gagaaacatc ctgccaggtt 660 

taaacatcac ctgtaccttc caGaagactc agaatccaca gcgtcccatt ttccgactag 720 

40 gagacatctt ccgagaaaca ggGgataatt ttfccagatgc ggcaattcag ggcggaataa 780 

i' tgggcattga gatctactgg gactgcaacG tagaccgtcg gttccatcac tgGGgcccca 840 

aataGagt;tt ccgtcgcctt gacgacaaga ccaccaacgt gtccttgtac Gctggctaca 900 



.4- 

acttcagata cgccaagtac tacaaggaaa acaatgttga gaaacggact ctgataaaag seo 
tcttcgggat ccgttttgac atcctggttt ttggcaccgg aggaaaattt gacattatcc 1020 
agctggttgt gtacatcggc tcaaccctct cctacttcgg tctggccgct gtgttcatcg 1080 
acttcctcat cgacacttac tccagtaact gccgtcgctc ccatatttat ccctggtgca 1140 
agtgctgtca gccctgtgtg gtcaacgaat actactacag gaagaagtgc gagtccattg 1200 
tggagccaaa gccgacatta aagtatgtgt cctttgtgga tgaatcccac attaggatgg 1260 
tgaaccagca gctactaggg agaagtctgc aagatgtcaa gggccaagaa gtcccaagac 1320 
ctgcgatgga cttcacagat ttgtccaggc tgccectggc cctccatgac acacccccga 1380 
ttcGtggaca accagaggag afcacagctgc ttagaaagga ggcgactcct agatccaggg 1440 
atagccccgt etggtgccag tgtggaagct gcctcccatc tcaactccct gagagccaca 1500 
ggtgcctgga ggagctgtgc tgccggaaaa agccgggggc ctgcatcacc acctcagagc 1560 
tgttcaggaa gctggtcctg tccagacacg tcctgcagtt cctcctgctc taccaggagc 1620 
CGttgctggc gctggatgtg gattccacca acagccggct: gcggcactgt gcctacaggt 1680 
gctacgGcac ctggcgcttc ggctcccagg acatggctga ctttgccacc ctgcccagct 174.0 
gctgccgctg gaggatccgg aaagagtttc cgaagagtga agggcagtac agtggcttca 180O 
agagtcctta ctgaagccag gcaccgfeggc tcacgtctgt aatcccacct ttt 1853 

<210>;.3. 
<211> 11266 
<212> DNA 

<213> Homo sapiens 
<400> 3 

Gaccacctac aaaggaaacc ccaaatGcag tagcagtcac tcc|cat:tet ccGGttcccc 60 
tgtccctggG cacagtctac tttGtgtctc tatagatgcc tattotggac atttcotata 120 
aatagaattg catatggtgt ggccttttgt: gtctgtGttc tttcactGag catcatgttc 180 
tccaggccca tccatgttgt agcctgtgcc attgcttcat ccttcttatg gctaaataag 240 
attGtgtgta tgaacgtacc acattttact cgtccattca tccgtcagtg gcoacttgca 300 
tggtttccaG ttttttggcg attctgagta gtgctgctat aagcattcgt gtgGacattc 360 
tggtggatat cgaatcaott: ctccacatct tagtaacaca cgtcacttac tccccactct 420 
gtoatccttc tatctgcagt atqccaGccg caggacgctc tgttcctctg aoogaggttg 480 
taaaaaggga tggatggacc cgcagagcaa aggtaccttc tgtttctttt cGcgagaccc 54 0 
taggggtgga tggtGtggca tcttggtgac atctgtgatg GGcaggtcag gtotccagcc 600 
totgotctca gctgccGtct CccaccatGa coaagccata ggcgagtctg Gccatgcttc 660 
ggctctgtGG ccagcagacc agctgctgac tgtaaacatg actccagttt tccagtgaga 720 
gaagaagctc Gtaaaaacct agcaggttca ggattctaat oggtagaaaa ctGaGaCggc 780 
Gtatagcatc atctgagtat tctaaacttt ccccctgaat ttcctcaaag gttgaggacc 840 
atgaactttt accoGcaggg gaacGtggca gcaataccca tattaacctg Gagaattttt 900 
tttgtctttt attttatttt attttttaaa catcttttgc aGtgttttat tttgattttg 960 
attctgattt tatttataCQ* taagtgcagt gctatcgcga tactgcagaa tttetttatc 1020 
tcacatttta acttaaaaag gcacagggca gcgagcgcag aggctggtgc GCgtaatccc 1080 
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agcactttgg g^gggtgagg cagatggatg 
aaaacatggt gaaaccccgt ctctactaaa 
acgcttataa tcccagctac ttgggaggct 
cagaggttgc agtgagccaa gatcatgcca 
cccctttaaa aaaaaaaaaa aaaggcacag 
aaaaaaaaaa tcagtggtta taatgcaaac 
attctactcc atactgtatg tgtatttgag 
ctatttaatt aattctccct tggaaactga 
ccatgctgat ttcatggctt tgcacatcct 
tgcccctttg agaggccatg ttctagtcca 
cgctcccctc atgggcacct tttcttataa 
ctttctgtgt tctctctgat ctttattatg 
tgtggttagg atgggcttga tggaagctga 
cctagtctct cgGCcgggtt gagttaatga 
ttctttcaca tctgtggttc tacgatgctt 
gtgtagtgca tgaagggaac cagaagacct 
cagtggaaga ggccccccgg tgagtcgcat 
gaccagatga ggccttgccg aggctgcttg 
gtcctgggtc ctaccggctt ggggacccct 
agtatctctg ggagggccat ggtggttggt 
taaatagcta Gtaccctgag catccttggg 
tccagggtac ccccagaccc cctcctgtag 
gtgtccatac tgagtgatca aggataggaa 
gcgsiggsi^^ig agaaggggaa ggggaggaaa 
ggaaaggaag ttttctcaaa ttaaatgctt 
tattgatgct tcgcttcaat acacaaagtc 
tgatgtattt cagcagagaa tagttagaaa 
attttgctgt gtgtccttgg gaaaatggct 
gtaaaagcag aataataatt tcaccaactt 
gatacttgta cagttattgt aaggtaagcc 
cacacacaca cacacacacg cacacacaca 
tggttctaga ccagcactgt ccaattgaac 
tgctgtccaa tagggcagct actaggtaca 
ctgaattttt gaaagagatg atagatgata 
tagataatag atagacagac aggtagatag 
agacagagtt ttgctatgtt gcccaggctg 
ctgccttggc ctcccaaagt gctggggtta 
ttttaattaa atttaaattt aaatagccac 
gcagttctag accgatgtga ttcaggatca 
ctgccccaag ctggcctgta gaaggctcag 
tgcgctcagc agcatcaccc cGtatgattc 
ttcccggtca ggggcatCtc tttgggaatc 



cttgaggtca ggggttcgag aacagcctgg 114 0 
aatacaaaaa tcagccagac atggtggcac 120 0 
gagacgtgag aatcacttga acctggaagg 1260 
ctgcactcca gcatgggtga cagagcgaga 13 2 0 
ggcaatttta aaaatactgc aaatagtaaa 1380 
acacacaaaa aggcatatgc ccattactgc 1440 
ttagtataaa agttatttta acattgctca 1500 
ttaatcatcc tggcactcca ggaagatgtg 1560 
gggcaggctg tgtacccctt gagggacttg 162 0 
tttatactaa gtgagagcat acacctgttc 1680 
agaaacaaaa gagccagGag aatccacagt 1740 
ttttgcttgt ttgcctcgcG ttgtgttcgt 1800 
agctgcgtgg gttggaaagc ctggtcaaag 1860 
tgtcGCtcct ggagaacgtc ctctctgcag 192 0. 
tgaGccctat aggaattGag accggaaggt 1980 
gtgaagtctG tgcctggtgc cccatcgagg 204 0 
gggg^gacag acacagtggc cctcagcggc 2100 
ggccttcGcc tctcagcaca gccctgcaaa 2160 
gcgctctgga tgcactgctt ggGacaaact 2220 
aaactgttgt aacactcctg taccaactgg 228 0 
tgtccctggG cccttecttc ccccagatct 2340 
tgccacagca ; ggatcccttc tgacttgtca 2 40 0 
ggaaggaggg agatggaaag gaaggacgaa 246 0 
aagcaaaagg ggtgagggta aaagaggggg 252 0 
acaatgacat acagatttgg cggtcccttg 2580 
acaatgttaa atctcagaag ccacaagggc 2640 
gacctggatt caattcctag ccctaacacc 2 700 
taacctctct gagtttcagt gtcctcacct 2760 
catagggctg ttgtaaggat taaatgagat 2820 
Gcatgcatgc ctggcttaca cacacacaca 2 88 0 
cacacacaca atctacccct agaagtgtgg 2 94 0 
ttgatgcagt gatggaaatt tctgtatGtg 3000 
tgtggctatt gagtacatga aatgcgacta 3060 
gatagaaaga tagatagata gataaataga 3120 
atagatagat agatagatag atagatagat 3180 
gttttgaact cctgggctca agcgatcctc 3 24 0 
caggtttgag ccattgctcc cagcCtgaat 3 300 
acatgtctag tggcCacGat attggacagc 3360 
ttCGCtcagc atcgtggggc aaagagaaaa 3420 
gcgaaggttt cccaatgccg ggatgggggg 34 80 
tcaatcgcta atagctccac tcaggc tGat 3 54 0 
acccagctct gggagataca gcagcctcca 3600 
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ctcaggtagt ccttgttcaa gacaagcggc ccttgactga ctgcagtttc agttccagct 3660 
iil^l; Gtgctatcaa ctcactcatt aaataaactg catctccagt gtgcctgcct ctgggctgga 3 720 

.Ay- ttttgacgtg acctgggcaa gcaactccct gaacttcagt ttctcatata ttatatgaat 3780 

tagctaagat ggttcgttta atcattcatt caacacatcc atcaccacgt agtaggtgtt 3840 
5 agatatttat ttcatacgta actacgcata agagactttg ctaagtttta ggtaaaatac 3900 
aagtcccaga tacggagcaa gtctcaacca ctgtacatac ctgaatgtgt aattacatca 3960 
ctgtaagagg tgccacagta aatgccactg ggtcttgtgt tagtccattc tcacacaaag 4020 
aactacctag ccaggtgcgg tggcteacgc ctgtaatccc aacactttgg caggctgagg 4O80 
caggcggatc acttgaggtc aggagtttga taccagcctg gcaaacatgg tgaaacccca 4140 

10 tctctattaa aaaatgcata aattagccag gtgtggtggc acacgcctgt aatcctggct 4200 
actcgggagg ctgaggcagg agagtcgctt gaacccggga gatggaggtt gcagtgaccc 4260 
gagatcgcgc cactgcactc cagcctgggt gacagagtga gactccatct cagaaaaaaa 4320 
taaaaataaa aataaagaac tacctgagac caaacacttt acgaaaaaaa agaggtttaa 4380 
ttgactcaca gctccacagg cttaacagga agcctcagga gactfcacaat: catggcagaa 44 40 

15 ggcgaagggg aagcaaacac atcttaccat gatggagcag gaggcgggtt tcgggggatg 4500 
. tgccgcacac ttttaaatga ccagatctcg tgagaactca ctcactatca cgagaacagc 4560 

aaggaggaag tGcgcccGca tgattcagtc acctcccacc aggcccctcc tctggcacat 4620 
ggggattaca attcaagatg agatttgggt ggggacacag agccaaacca tatcagatct 4680 
caagaaggga gaaattcttG ttggaggagc tggaggggct ttgtggagag tttGagaatg 47 40 

20 ctttgcGcac taggtttgct gtatccattt ctcttcatgc atcccaaaga ccaagccaag 48 00 
aaaccagaag cctctggtcc cactggccca tgggctccct cggtccccac cgtcactaat 4860 
ggccattttg catgtctctc CGGcaggcct gctGtcttga acagtgccga aaacttcact 4920 
gtgctcatca agaacaatat cgacttcccG ggccacaact acaccacgta agtgcccagg 4980 
ccgcctggcc gtcttagtta tctactgGtg agtaataaat taCCGGaaac ctcagaagcG 50 4 0 

25 tgaaacaaca aacgcceatt gtctcccacg gtttctgtgg gtcaggaatc cgggaatgac 51O0 

I ■ ■ 

ttdgctgcgt ggttGtggct caaggtctgt caggttgtag ccaagctgtc aaccagggct 5160 

gcagtcattt ctaggcttga ctggggctgg agaacecttt tccaagctct cacacagttg 5220 

ctcgtgggag agctcagttG Gtcaccacgt gaacctcgcc ctagaccact tgagtatcct 5280 

. tggtatatgg tggctggctt ctcccagagc aagtgaccca agagagaGag agcaagcaac 53 40 

30 caagagtata accaagatgg aagccacagt ctttgggggg agaccccaac acttGtgcca 5400 

; tatgccattg gtcacacaga tcaaccctgg fcccagtgtga gaggGcactg cccaggggtc 5460 

ccaggaggca gggatcattt ggggctttca tggaacctct ccaccacact ggctcactcG 5520 

^-rtgggaaagag acagatctgt Cttcaatcga gatgtttgtt tgtttgcttt taattatgga 5580 

;■ caggagaaac atcctgccag gtctaaacat cacttgtacc ttccacaaga ctcagaatcc 5640 

35 acagtgtccc attttccgac taggagacat cttccgagaa acaggcgata atttttcaga 5700 

tgtggcaatt caggttggtg gtgctttgta cactgggatg tggggctgtg tgtctaggga 5760 

tggaggatgt caaacagcca agaggccggg ccactgggtc ttcataatgt ggctcaGatt 5820 

tactgagcat ttagtaaatG cacccgctgG gGtaaggtct ttacctacgc tacGCcgtca 5880 

aatcccaaaa caatccttat gagtgagagc tacttggtgt attcctctcG tgcggctgct 5940 

40 gtagcaagtt atcaaagctt agcggcttea aacaacacat atttgcttat gttgccagag 60O0 

atcagaagtt ggagatgatt ttccctgagG cagggcggtg ctCGctctgg gactttaagg 6060 

gagaatccag ctcctcagGt. tttccacctt ctggagGtgc attccttgca tctctccaaa 6120 
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gccagcagca taacatcttg cctcagtggc 

ccctttgcct ctgtcttaaa aagagagaga 

actggataat ccaggataat ctcccatctc 

ctcatgcctg taatcccagc actttgggag 

5 agttcaacac cagcctggcc aacatggtga 

aaaaaaaaat agccgggcat gattgcgggc 

gacaggagaa tcgcttgaac ctgggaggca 

gcactccagc ctaggtgaca agagcgaaac 

atgtattcgc atctgcaaag agctttccct 

10 atatttgata gagtgccctg aattccagtc 

gcgtgggggg cattaaaggt gtttgggtac 

gacaatgagt ttaacagtga tgtgtcccag 

tgtggctgcG acaataacac ttgtgcgagt 

cagcattttt tctctaaaaa ccttaagcaa 

15 tgcgtaactc acacccagca gccatagaga 

aatgggcatt gagacctact gggactgcaa 

caaatacagt ttccgtcgcc ttgacgacaa 

caacttcagg CaactGcaag gcceaggtca 

ggaactggtg agactaattt tggtttccaa 

20 ctaagaacta ggtgataact gaattttttc 

cactctttat tttttactta tttttttttt 

gctgaagtgc agtggcgcca tctcagtcac 

ctctcGtgcc tcagcctccc aagtagctga 

acaaaactat Ggggaaacag aaaagcatag 

25 accctgaaat aatcaataac aaccctcggg 

atacagctcc tatgagatea tagcatatat 

ttttcatacc acagccactc aaaattctct 

cattttgtag gtgaacaaat aacaactgct 

ttttcttttc tttttttttt ttagatggag 

30 gtggcatgat ctcagctcac tgcaacctct 

cccaagtttG tgggaGcaca ggcatgtgcc 

tagagatggg gtttcactgt gttggcGagt 

ttcccacGtt ggcttcccaa agtgctagga 

aggagggttt tCGatatacc aagcactccc 

35 tggaaggaag atattgtttc tggaagatga 

gtttgcataa ttctaaccca cgttcactGt 

ttGCcatctc aatgattttc tcaccgtacg 

ccagtttttc tagtacaagt aaccacagtg 

aattccaggg ttacttccct aggataaatt 

40 cattttatca tacgcttctt atttttaaaa 

ttGtgagaca gagtctcact ctgtcaGCGa 

caGtgGaecc tctgcctcCG aggcccaagt 
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cactttcact ccctatcctg tgtccaatct 6180 
gcatttacaa gagggggcat ttaaggacca 624 0 
aagatccttc atttaggctg ggcacggtgg 6300 
gctgaggtgg gtggatcacc tgaggtcagg 6360 
aagcccatct ttactaaaaa tacaaaaaaa 6420 
tcctgtaatc ccagctactc gggaggctga 6480 
gaggttgcag tgagccgaga tcgcaccact 6 54 0 
tccatctcaa aaaaaaaaaa aaaatccttc 6600 
aggggagtac taggaggtaa agcagaaaag 6660 
taataagttt ggacttgatc tttaatgggg 6720 
^99sgtggtc tgttgaaagt tgtattttag 6780 
acgggggtag ggagagtgag gagatgcgat 6 84 0 
taggtggggc tgtacatatg gttcttcaat 6900 
tcctggctat gGagggagat gtctggcggt 6 960 
ctgtcccttg ttgatccttc agggcggaat 7020 
cctagaccgt tggttccatc actgcGgtcc 7 080 
gaccaccaac gtgtcGttgt accctggcta 7140 
aactcacGca gtggctgaat cgcattcGca 720O 
ggcaacaaga tgaatgaaaa aagactttct 7260 
cataattttt taaaattctc aaaagagata 7320 
tttgaaatgg agtGtcactc tgtcacccag 7380 
tgGaaacttc cgcctcccag gttcaagcga 744 0 
gattacaggc ggatgGaGac tgtttataaa 7500 
c^9999g^t:ca aaatcgccca taattcccct 7560 
ggaattttcc tcatctgtac caattatttc 7620 
atatatatat cttgtggtat tctgcagggt 7680 
gtaaccatca cattaatgat cataacattc 7740 
acaattcagg aagtgttttc ttttcttttc 7800 
tcacactctg Gttgcccagg ccggagtgca 7860 
gcctcctagg tGcaagcgat cctcccacct 7920 
acGacaccca gctaattttt gtatattcag 7980 
ctggtctcga actcttgacc tcaagtgatc 8 040 
ttacagtcat gagGcactgt gcctggccca BlOO 
catcgccatc cctaaatctc ccaacaacGc 8160 
tttgcccaag acGcacagct gatagtacat 8220 
gacccGacac tcacactGCC atcGcttccc 8280 
cctccatgaa ttgaatattt gagt'tgcttc 8340 
tgcatctttg caccgtaaaG ttcttctttg 8400 
tcctagactt attgaatcaa aggtcgtgaa 8 4 60 
atatctatgg ttataatgtt tcattttttt 8520 
ggttggagtg gaccgggtgc aattatagct 8 580 
gatcctccca cctcagcctc ctgagtagct 8640 
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aggactacag gtgcatgcca ccatgcccag ctaattttta aaattttttt gtagagtcgg S700 
ggtctcacta tattgcccag gctggtctca aactcctggc tcaagcaatc cgcctgcctt 8 760 
ggcctcctaa agcgttggga ttgcaggtgt gagctactgc acctggccta taattttcat 8820 
ttctaggatt tttatttggt gccttttcaa agtcatctat tattgttcca gtgagtccca 8880 
5 ttcttacctt aaggatccta ctccttctgt ccattctact gtatcattcc tttcataccg 8940 
acctattatc tgaagtaact tgggtgggag ttctcctcgt gggctttgaa atactgtctt 9000 
cagtagaaaa gatcttatgc aaagttctgc atgtgctgtg tgttatggag tattccttat 9060 
atattactca aagcagcttc atgaactggc aggcagcccc ttaaatgtgt ttctgtttct 9120 
gtttttttgt tttttggtctgcatgatgtt ttaaaacttg agacaggcca ggtgcggtgg 9180 

10 cttatgcctg taatcccagc actttgggag gccaaggtgg gaggatcact tgaacccggg 924 0 
ggtttgaggc tagcctgggc aacataggga gaccccatct ctacaaataa taatttttaa 9300 
aacatcagcc aggtgtgggg gcatgcacct ctggtcccag ctactcagga ggctgaggca 9360 
ggaggattgc ttgaacctgg gaggttgagg ctgcagtgaa ccgtggttgt gccactgcac 9420 
tccagcctgg gtgacagagt gagaccctta catggtggcc actggctgga gctgagtatc 9480 

15 agtggtccta tttagaaagg ggctgggctt tctggttcat cactgtcecc gccgctcctt 9S4 0 
agtgcttata cctggcccac atcactcatt tctgtcatct gcctggcccc tgtgtagaca 9600 
tttgagtttg aaacccttga ctcaaaggca ggctgatgct ttttgcttcc tctggatcaa 9660 
tggaatttca tccaaggcaa tggaaccagc tcctaattgt gataactttt tgatcatccc 9720 
ctgccactgt gctgaatagg ettatggcca ttaagaagag aggacaggat gaatggtccc 9780 

20 ctgactgact gcccttccag ggtgtttttt tagctgtcag cagaagcatg cggggcagtg 9840 
taccaatcag gtgtcagcaa gtgtcctccc agcgactgag ttgagcacaa aaacatctgc 9900 
tcctggagag acctgagccc tctgaaggcc tcagGqagtt: atgacgttaa cgttctttta 9960 
gaacaaagtg gtacacccat ttgttgtctg gaatgagcac cagaacaaat ctttggccaa 10020 
aaataaattg ttgagggccg ggagcagtgg cttatgcctg taatcccaac actttgggag 10080 

25 gccgaagcag gaggatcacc tgaggtcagg agttcgagac cagcctggcc aacatggtga 1O140 
aaccctgtct ctcctaaaaa tacaaaaatt agctggacgt ggtgacaggt gcctgtaata X0200 
cagctactgg gaggctgagg cagggagaac tgcttgaacc cggaaggtgg aggttgcagt 10260 
gagccggggt catgccactt cactccagcc tgggcaacag agcaaagctc tgtctttaaa 10320 
aacaacttaa ataaataaat aaattgttga ggtctgatga gtaagtggac aagttatttt 10380 

30 ccagcagaca cacaaaagag aaggaaatta caggttatag gaggtatttc agaaaatata 10440 
actttetaaa acataggaag ttgaagaagt tgatcacatt acagaattct gttgtttaga 105 00 
aaatgacctg tggcgaaatg tccttattca gtgaataggt gattccgctt atgcacgacc 10560 
tgtgtgaagt ggatcaggcc acccagaatg cacgatgcgc ttctcaggcc cagcaggagt 1G620 
atgtgtctgt gttaatttcc tgtggctatt atgactaatt gccacaaatg tggtggctta 10680 

35 aaacaacaga aattaatctt cttatagttc tggaagccag aagtttggaa tcaagatgtc 10740 
agcagggcca cactcgctct gatgctctac gggagggtcc tctcttgcct cttccagcgt 10800 
ctggtggctc caggcattcc ataactttat agcagcgtcc cacaaatctc tgcctccatc 10860 
ctcacatggc cttctccact gtgtctctat gtcttcaatc tctttttttt tttttttttt 10920 
ttgaggcagg gtttcactcc agtctcctag actgaagtgc aatggcgtaa tttcgggtca 1O980 

40 ctgcaacctc tgcctcccgg gctcaagcga tttgatctct cctttatctt ataaagatac 11040 
tagtcattgg atttggggct taccctaaat ccaggataat ctcatcttga gatgtttaac lllOO 
ttaattatat ctgcaaacac tgtatttcca aataaggtca tatcacagcc actagggatt 11160 
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jll'JJj agatacttga acatatctta tttgggggct caacccattc cagtgtacga aaaacactct 11220 

tgttcaaggc ccgatgtttc tcagggcata gcccactgac tacctg 11266 



5 <210> 4 

<211> 595 
<212> PRT 
<213> Homo sapiens 

10 <400> 4 

Met Pro Ala Cys Cys Ser Cys Ser Asp Val Phe Gin Tyr Glu Thr Asn 
1 5 10 15 

Lys Val Thr Arg lie Gin Ser Met Asn Tyr Gly Thr lie Lys Trp Phe 
15 20 25 30 

Phe His Val lie lie Phe Ser Tyr Val Cys Phe Ala Leu Val Ser Asp 
35 40 45 

20 Lys Leu Tyr Gin Arg Lys Glu Pro Val lie Ser Ser Val His Thr Lys 
50 55 60 

Val Lys Gly lie Ala Glu Val Lys Glu Glu lie Val Glu Asn Gly Val 

65 ■ 70 75 * 80 

25 I 

Lys Lys Leu Val His Ser Val Phe Asp Thr Ala Asp Tyr Thr Phe Pro 
85 90 95 

Leu Gin Gly Asn Ser Phe Phe Val Met Thr: Asn Phe Leu Lys Thr Glu 
30 100 105 110 

Gly Gin Glu Gin Arg Leu Cys Pro Glu Tyr Pro Thr Arg Arg Thr Leu 
115 120 125 

35 Cys Ser Ser Asp Arg Gly Cys Lys Lys Gly Trp Met Asp Pro Gin Ser 
130 135 140 

Lys Gly lie Gin Thr Gly Arg Cys Val Val His Glu Gly Asn Gin Lys 
145 150 155 160 

40 



Thr Cys Glu Val Ser Ala Trp Cys Pro lie Glu Ala Val Glu Glu Ala 

; 165 170 ■ 175 , 
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pro Arg Pro Ala Leu Leu Asn Ser Ala Glu Asn Phe Thr Val Leu lie 
180 185 190 

5 Lys Asn Asn lie Asp Phe Pro Gly His Asn Tyr Thr Thr Arg Asn lie 
195 200 205 

Leu Pro Gly Leu Asn lie Thr Cys Thr Phe His Lys Thr Gin Asn Pro 
210 215 220 

0 

Gin Cys Pro He Phe Arg Leu Gly Asp He Phe Arg Glu Thr Gly Asp 
22S 230 235 240 

Asn Phe Ser Asp Val Ala He Gin Gly Gly He Met Gly He Glu lie 
5 245 250 255 

Tyr Trp Asp Cys Asn Leu Asp Arg Trp Phe His His Cys Arg Pro Lys 
260 265 270 

0 Tyr Ser Phe Arg Arg Leu Asp Asp Lys Thr Thr Asn Val Ser Leu Tyr 
275 280 285 

Pro Gly Tyr Asn Phe Arg Tyr Ala Lys Tyr Tyr Lys Glu Asn Asn Val 
290 295 300 

•5 ■ ■ ■ .1. 

Glu Lys Arg Thr Leu He Lys Val Phe Gly Tie Arg Phe Asp He Leu 
305 310 315. 320 

Val Phe Gly Thr Gly Gly Lys Phe Asp He He Gin Leu Val Val Tyr 
JO 325 ^ 330 335 

He Gly Ser Thr Leu ser Tyr Phe Gly Leu Ala Ala val Phe He Asp 

340 345 : 350 

35 Phe Leu He Asp Thr Tyr Ser Ser Asn Cys Cys Arg Ser His lie Tyr 
355 360 365 

Pro Trp Cys Lys Cys Cys Gin Pro Cys Val Val Asn Glu Tyr Tyr Tyr 
370 375 380 

40 

Arg Lys Lys Cys Glu Ser He Val Glu Pro Lys Pro Thr Leu Lys Tyr 
385 390 395 400 



100203 GB-2 



-11- 



Val Ser Phe Val Asp Glu Ser His lie Arg Met Val Asn Gin Gin Leu 
405 410 415 

Leu Gly Arg Ser Leu Gin Asp Val Lys Gly Gin Glu Val Pro Arg Pro 
420 425 430 

Ala Met Asp Phe Thr Asp Leu Ser Arg Leu Pro Leu Ala Leu His Asp 
435 440 445 

Thr Pro Pro lie Pro Gly Gin Pro Glu Glu lie Gin Leu Leu Arg Lys 
4 5 0 4 5 5 4 60 

Glu Ala Thr Pro Arg Ser Arg Asp Ser Pro Val Trp Cys Gin Cys Gly 
465 470 475 480 

Ser Cys Leu Pro Ser Gin Leu Pro Glu Ser His Arg Cys Leu Glu Glu 
485 490 495 

Leu Cys Cys Arg Lys Lys Pro Gly Ala Cys lie Thr Thr Ser Glu Leu 
500 505 510 

Phe Arg Lys Leu Val Leu Ser Arg His Val Leu Gin Phe Leu Leu Leu 
515 520 525 

I . • ■ ■" 

Tyr Gin Glu Pro Leu Leu Ala Leu Asp Val Asp Ser Thr Asn Ser Arg 
530 535 540 

Leu Arg His Cys Ala Tyr Arg Cys Tyr Ala Thr Trp Arg Phe Gly Ser 
545 550 555 560 

Gin Asp Met Ala Asp Phe Ala He Leu Pro Ser Cys Cys Arg Trp Arg 
565 570 575 

He Arg Lys Glu Phe Pro Lys Ser Glu Gly Gin Tyr Ser Gly Phe Lys 
580 585 590 



Ser Pro Tyr 
595 
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